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In the last three decades a large number of compiler transformations for optimizing programs have 
been implemented. Most optimizations for uniprocessors reduce the number of instructions 
executed by the program using transformations based on the analysis of scalar quantities and data- 
flow techniques. In contrast, optimizations for high-performance superscalar, vector, and parallel 
processors maximize parallelism and memory locality with transformations that rely on tracking 
the properties o ... 



Keywords: compilation, dependence analysis, locality, multiprocessors, optimization, parallelism, 
superscalar processors, vectorization 



2 Lattice fr^ 

Stephen P. Masticola, Thomas J. Marlowe, Barbara G. Ryder 

September 1995 ACM Transactions on Programming Languages and Systems (TOPLAS), 
Volume 17 Issue 5 

Full text available: Hpdf(1.66 Additional Information: full citation, abstract; references , citings, 
MB) index terms , review 

Multisource data flow problems involve information which may enter nodes independently 
through different classes of edges. In some cases, dissimilar meet operations appear to be used for 
different types of nodes. These problems include bidirectional and flow-sensitive problems as well 
as many static analyses of concurrent programs with synchronization. K-tuple frameworks, a type 
of standard data flow framework, provide a na ... 

Keywords: data flow analysis, lattice frameworks 



http://portal.acm.org/results.cfm?CFro=52012114&CFTOKEN=48557289&adv=l&COLL= 



9/1/05 



Results (page 1): +bitwise +constant +propagation +precision operator operation Page 2 of 7 



3 Query evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), Volume 25 Issue 2 

Full text available: ffij2df(9.37. Additional Information: ML citation, abstract, references, citings, 
MB) i&dS2Ltens& review 

Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to provide 
acceptable performance. The advent of object-oriented and extensible database systems will not 
solve this problem. On the contrary, modern data models exacerbate the problem: In order to 
manipulate large sets of complex objects as efficiently as today's database systems manipulate 
simple records, query -processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible database 
systems, iterators, object-oriented database systems, operator model of parallelization, parallel 
algorithms, relational database systems, set-matching algorithms, sort-hash duality 



4 C and tec: a language and compiler for dynamic code generation 

Massimiliano Poletto, Wilson C. Hsieh, Dawson R. Engler, M. Frans Kaashoek 
March 1999 ACM Transactions on Programming Languages and Systems (TOPLAS), Volume 
21 Issue 2 

Full text available: 1ll pdf(471 ,68 Additional Information: full citation, abstract, references, citings . 
KB). index^tenns, review 

Dynamic code generation allows programmers to use run-time information in order to achieve 
performance and expressiveness superior to those of static code. The 'C(Tick C) language is a 
superset of ANSI C that supports efficient and high-level use of dynamic code generation. 'C 
provides dynamic code generation at the level of C expressions and statements and supports the 
composition of dynamic code at run time. These features enable programmers to add dynamic 
code generation ... 

Keywords: ANSI C, compilers, dynamic code generation, dynamic code optimization 
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January 2000 ACM Transactions on Software Engineering and Methodology (TOSEM), 
Volume 9 Issue 1 

Full text available: S.pdf(4Q0.,17 Additional Information: MLcitation, abstract, references, citings, 
KB) index terms 

There has been a surge of progress in automated verification methods based on state exploration. 
In areas like hardware design, these technologies are rapidly augmenting key phases of testing and 
validation. To date, one of the most successful of these methods has been symbolic model- 
. checking, in which large finite-state machines are encoded into compact data structures such as 
Binary Decision Diagrams (BDDs), and are then checked for safety and liveness properties. 
However, these technique ... 
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This paper describes how the cmcc compiler reuses code — both internally (reuse between difFeren 
modules) and externally (reuse between versions for different target machines). The key to reuse 
are the application frameworks developed for global data-flow analysis, code generation, 
instruction scheduling, and register allocation. The code produced by cmcc is as good as the code 
produced by the native compilers for the MIPS and SPARC, although significantly less resources 
have been spent on cmcc ... 
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Alistair Moffat, Radford M. Neal, Ian H. Witten 

July 1998 ACM Transactions on Information Systems (TOIS), Volume 16 Issue 3 
Full text available: ®.pdf(487„.26 Additional Information: follcitation, abstract, references, citings, 
KB) index terms 

Over the last decade, arithmetic coding has emerged as an important compression tool. It is now 
the method of choice for adaptive coding on myltisymbol alphabets because of its speed, low 
storage requirements, and effectiveness of compression. This article describes a new 
implementation of arithmetic coding that incorporates several improvements over a widely used 
earlier version by Witten, Neal, and Geary, which has become a de facto standard. These 
improvements include f . . . 

Keywords: approximate coding, arithmetic coding, text compression, word-based model 
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July 1992 ACM Transactions on Modeling and Computer Simulation (TOMACS), Volume 2 
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Full text available: Hpdf(1.23 Additional Information: full citation, abstract, references , citings , 
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Low-discrepancy sequences are used for numerical integration, in simulation, and in related 
applications. Techniques for producing such sequences have been proposed by, among others, 
Hal ton, Sobol', Faure, and Niederreiter. Niederreiter' s sequences have the best theoretical 
asymptotic properties. The paper describes two ways to implement the latter sequences on a 
computer and discusses the results obtained in various practical tests on particular integrals. 

Keywords: Niederreiter sequences, Quasi -Monte Carlo methods, low-discrepancy sequences, 
quasirandom sequences 
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Essential Language el(α) is developed as a "Reduced Expression Set Language" which is 
simple and yet satisfies conditions for efficient programming and efficient execution. The 
grammar of el(α) is determined by unifying a machine model applicable to almost all 
computers and a language model with features essential for system programming. Much efforts 
are exerted to make the language machine independent and to eliminate execution-time overhead. 
Its simple syntax drawn in 2 dia ... 
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Mary L. Bailey, Jack V. Briner, Roger D. Chamberlain 

September 1994 ACM Computing Surveys (CSUR), Volume 26 Issue 3 

Full text available: HI pelf (3. 74 Additional Information: full citation, abstract, references, citings , 
MB) index terms 

Fast, efficient logic simulators are an essential tool in modern VLSI system design. Logic 
simulation is used extensively for design verification prior to fabrication, and as VLSI systems 
grow in size, the execution time required by simulation is becoming more and more significant. 
Faster logic simulators will have an appreciable economic impact, speeding time to market while 
ensuring more thorough system design testing. One approach to this problem is to utilize parallel 
processing, taking ... 

Keywords: circuit structure, parallel architecture, parallelism, partitioning, synchronization 
algorithm, timing granularity 
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August 2000 ACM SIGCOMM Computer Communication Review , Proceedings of the 
conference on Applications, Technologies, Architectures, and Protocols for 
Computer Communication, Volume 30 Issue 4 

Full text available: HI pd.fT43S.48 Additional Information: full citation, abstract , references , index 
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In writing networking code, one is often faced with the task of interpreting a raw buffer according 
to a standardized packet format. This is needed, for example, when monitoring network traffic for 
specific kinds of packets, or when unmarshaling an incoming packet for protocol processing. In 
such cases, a programmer typically writes C code that understands the grammar of a packet and 
that also performs any necessary byte-order and alignment adjustments. Because of the 
complexity of certain ... 
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Christopher K. Lennard, Patrick Schaumont, Gjalt de Jong, Anssi Haverinen, Pete Hardee 
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multi-resolution visibility, progressive multi-gridding, visibility error 
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Karl Crary, Stephanie Weirich, Greg Morrisett 

September 1998 ACM SIGPLAN Notices , Proceedings of the third ACM SIGPLAN 

international conference on Functional programming, Volume 34 Issue 1 
Full text available: ®pdf(l, 27 Additional Information: full citation, abstract, references, citings, 
MB). index terms 

Intensional polymorphism, the ability to dispatch to different routines based on types at run time, 
enables a variety of advanced implementation techniques for polymorphic languages, including 
tag-free garbage collection, unboxed function arguments, polymorphic marshalling, and flattened 
data structures. To date, languages that support intensional polymorphism have required a type- 
passing (as opposed to type-erasure) interpretation where types are constructed and passed to 
polymorphic functions ... 
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Mark Stephenson, Jonathan Babb, Saman Amarasinghe 

May 2000 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2000 conference on 

Programming language design and implementation, Volume 35 Issue 5 
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This paper introduces Bitwise, a compiler that minimizes the bitwidth the number of bits used to 
represent each operand for both integers and pointers in a program. By propagating 70 static 
information both forward and backward in the program dataflow graph, Bitwise frees the 
programmer from declaring bitwidth invariants in cases where the compiler can determine 
bitwidths automatically. Because loop instructions comprise the bulk of dynamically executed 
instructions, Bitw ... 
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This paper presents a vector generation approach for testing interconnects in configurable 
(SRAM-based) Field Programmable Gate Arrays (FPGAs). The proposed approach detects 
bridging faults and is based on quiescent current (IDDQ monitoring. Compared with previous 
voltage-based methods, EDDQ testing has the advantage of utilizing a small number of 
programming phases for configuring the FPGA during the test process w ... 

17 Session 11: seminumerical algorithms: Implementation of a portable and reproducible parallel 
pseudorandom number generator 

Daniel V. Pryor, Steven A. Cuccaro, Michael Mascagni, M. L. Robinson 

November 1994 Proceedings of the 1994 ACM/IEEE conference on Supercomputing 
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We describe in detail the parallel implementation of a family of additive lagged-Fibonacci 
pseudorandom number generators. The theoretical structure of these generators is exploited to 
preserve their well-known randomness properties and to provide a parallel system of distinct 
cycles. The algorithm presented here solves the reproducibility problem for a far larger class of 
parallel Monte Carlo applications than has been previously possible. In particular, Monte Carlo 
applications that undergo "s ... 
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This paper presents the operation of the register file in the Multiscalar architecture. The register 
file provides the appearance of a logically centralized register file, yet is implemented as 
physically decentralized register files, queues, and control logic in a Multiscalar processor. We 
address the key issues of storage, communication, and synchronization required for successful 
design and discuss the complications that arise in the face of speculation. In particular, the 
hardware requi ... 
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Data sets in large applications are often too massive to fit completely inside the computers internal 
memory. The resulting input/output communication (or I/O) between fast internal memory and 
slower external memory (such as disks) can be a major performance bottleneck. In this article we 
survey the state of the art in the design and analysis of external memory (or EM) algorithms and 
data structures, where the goal is to exploit locality in order to reduce the I/O costs. We consider a 
varie ... 
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